Pitch-based Emphasis Detection for Characterization of Meeting Recordings

نویسندگان

  • Lyndon S. Kennedy
  • Daniel P.W. Ellis
چکیده

The automatic extraction of key utterances in spoken data has emerged as an interesting and difficult topic in automatic speech recognition. “Emphasis” or “excitement” may be a useful identifier for these utterances of interest. In this paper, we undertake the task of reliably and automatically identifying emphasized or excited utterances in natural speech in a meeting setting. We start by endeavoring to establish reliable ground truth emphasis labels by using several hand-labelers. The results show that human listeners can reliably identify emphasized utterances in meeting recordings. We then build an automatic emphasis detection system, which uses normalized pitch as its only acoustic predictor. The results show that this pitch-based emphasis detection scheme can distinguish between non-emphasized and emphasized utterances with an accuracy of 92% when ambiguous cases are excluded, a rate comparable to human interlabeler agreement.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pitch-based emphasis detection for segmenting speech recordings

This paper describes a technique to automatically locate emphasized segments of a speech recording based on pitch. These salient portions can be used in a variety of applications, but were originally designed to be used in an interactive system that enables high-speed skimming and browsing of speech recordings. Previous techniques to detect emphasis have used Hidden Markov Models; emphasized re...

متن کامل

Musical Offset Detection of Pitched Instruments: The Case of Violin

Musical offset detection is an integral part of a music signal processing system that requires complete characterization of note events. However, unlike onset detection, offset detection has seldom been the subject of an in-depth study in the music information retrieval community, possibly because of the ambiguity involved in the determination of offset times in music. This paper presents a pre...

متن کامل

Multi-Pitch Detection and Voice Assignment for A Cappella Recordings of Multiple Singers

This paper presents a multi-pitch detection and voice assignment method applied to audio recordings containing a cappella performances with multiple singers. A novel approach combining an acoustic model for multi-pitch detection and a music language model for voice separation and assignment is proposed. The acoustic model is a spectrogram factorization process based on Probabilistic Latent Comp...

متن کامل

An Automatic Pitch Detection Method Based on Multi-feature for Mandarin Speech

There are many traditional pitch detection methods, but most of them can’t perform perfectly for different speakers, applications and environmental conditions. For this reason, a pitch detection method based on multi-feature is proposed. Firstly, the speech signals are pre-filtered. Secondly, the speech signal pre-filtered is segmented into syllables. Finally, the pitch period is obtained by wa...

متن کامل

Human Echolocation in Static Situations: Auditory Models of Detection Thresholds for Distance, Pitch, Loudness and Timbre

We investigated, by using auditory models, how three perceptual parameters, loudness, pitch and sharpness, determine human echolocation. We used acoustic recordings from two previous studies, both from stationary situations, and their resulting perceptual data as input to our analysis. An initial analysis was on the room acoustics of the recordings. The parameters of interest were sound pressur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003